AITopics | medical problem

Collaborating Authors

medical problem

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Efficiency at Scale: Investigating the Performance of Diminutive Language Models in Clinical Tasks

Taylor, Niall, Ghose, Upamanyu, Rohanian, Omid, Nouriborji, Mohammadmahdi, Kormilitzin, Andrey, Clifton, David, Nevado-Holgado, Alejo

arXiv.org Artificial IntelligenceFeb-16-2024

The entry of large language models (LLMs) into research and commercial spaces has led to a trend of ever-larger models, with initial promises of generalisability, followed by a widespread desire to downsize and create specialised models without the need for complete fine-tuning, using Parameter Efficient Fine-tuning (PEFT) methods. We present an investigation into the suitability of different PEFT methods to clinical decision-making tasks, across a range of model sizes, including extremely small models with as few as $25$ million parameters. Our analysis shows that the performance of most PEFT approaches varies significantly from one task to another, with the exception of LoRA, which maintains relatively high performance across all model sizes and tasks, typically approaching or matching full fine-tuned performance. The effectiveness of PEFT methods in the clinical domain is evident, particularly for specialised models which can operate on low-cost, in-house computing infrastructure. The advantages of these models, in terms of speed and reduced training costs, dramatically outweighs any performance gain from large foundation LLMs. Furthermore, we highlight how domain-specific pre-training interacts with PEFT methods and model size, and discuss how these factors interplay to provide the best efficiency-performance trade-off. Full code available at: tbd.

arxiv, llm, peft method, (15 more...)

arXiv.org Artificial Intelligence

2402.10597

Country:

Europe > United Kingdom > England > Oxfordshire > Oxford (0.28)
Asia > Middle East > Israel (0.04)
Asia > Middle East > Iran > Tehran Province > Tehran (0.04)
Asia > China > Hong Kong (0.04)

Genre: Research Report > New Finding (0.46)

Industry:

Health & Medicine > Therapeutic Area (0.93)
Health & Medicine > Diagnostic Medicine (0.93)
Health & Medicine > Health Care Providers & Services (0.68)
Health & Medicine > Health Care Technology > Medical Record (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Exploring the Effectiveness of Instruction Tuning in Biomedical Language Processing

Rohanian, Omid, Nouriborji, Mohammadmahdi, Clifton, David A.

arXiv.org Artificial IntelligenceDec-31-2023

Large Language Models (LLMs), particularly those similar to ChatGPT, have significantly influenced the field of Natural Language Processing (NLP). While these models excel in general language tasks, their performance in domain-specific downstream tasks such as biomedical and clinical Named Entity Recognition (NER), Relation Extraction (RE), and Medical Natural Language Inference (NLI) is still evolving. In this context, our study investigates the potential of instruction tuning for biomedical language processing, applying this technique to two general LLMs of substantial scale. We present a comprehensive, instruction-based model trained on a dataset that consists of approximately $200,000$ instruction-focused samples. This dataset represents a carefully curated compilation of existing data, meticulously adapted and reformatted to align with the specific requirements of our instruction-based tasks. This initiative represents an important step in utilising such models to achieve results on par with specialised encoder-only models like BioBERT and BioClinicalBERT for various classical biomedical NLP tasks. Our work includes an analysis of the dataset's composition and its impact on model performance, providing insights into the intricacies of instruction tuning. By sharing our codes, models, and the distinctively assembled instruction-based dataset, we seek to encourage ongoing research and development in this area.

dataset, instruction, medical problem, (10 more...)

arXiv.org Artificial Intelligence

2401.00579

Country:

Europe > United Kingdom > England > Oxfordshire > Oxford (0.14)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Europe > Italy > Emilia-Romagna > Metropolitan City of Bologna > Bologna (0.04)
(2 more...)

Genre: Research Report (1.00)

Industry:

Health & Medicine > Therapeutic Area > Gastroenterology (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (0.94)
Health & Medicine > Diagnostic Medicine (0.69)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.75)

Add feedback

Are Large Language Models Ready for Healthcare? A Comparative Study on Clinical Language Understanding

Wang, Yuqing, Zhao, Yun, Petzold, Linda

arXiv.org Artificial IntelligenceJul-30-2023

Large language models (LLMs) have made significant progress in various domains, including healthcare. However, the specialized nature of clinical language understanding tasks presents unique challenges and limitations that warrant further investigation. In this study, we conduct a comprehensive evaluation of state-of-the-art LLMs, namely GPT-3.5, GPT-4, and Bard, within the realm of clinical language understanding tasks. These tasks span a diverse range, including named entity recognition, relation extraction, natural language inference, semantic textual similarity, document classification, and question-answering. We also introduce a novel prompting strategy, self-questioning prompting (SQP), tailored to enhance LLMs' performance by eliciting informative questions and answers pertinent to the clinical scenarios at hand. Our evaluation underscores the significance of task-specific learning strategies and prompting techniques for improving LLMs' effectiveness in healthcare-related tasks. Additionally, our in-depth error analysis on the challenging relation extraction task offers valuable insights into error distribution and potential avenues for improvement using SQP. Our study sheds light on the practical implications of employing LLMs in the specialized domain of healthcare, serving as a foundation for future research and the development of potential applications in healthcare settings.

large language model, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2304.05368

Country:

North America > United States > California > Santa Barbara County > Santa Barbara (0.04)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
South America > Argentina > Patagonia > Río Negro Province > Viedma (0.04)

Genre: Research Report > New Finding (1.00)

Industry:

Health & Medicine > Therapeutic Area (1.00)
Health & Medicine > Health Care Technology > Medical Record (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Zero-shot Clinical Entity Recognition using ChatGPT

Hu, Yan, Ameer, Iqra, Zuo, Xu, Peng, Xueqing, Zhou, Yujia, Li, Zehan, Li, Yiming, Li, Jianfu, Jiang, Xiaoqian, Xu, Hua

arXiv.org Artificial IntelligenceMay-15-2023

We noticed that ChatGPT struggled to extract co-reference entities like "her medications" or "her symptoms", which should be annotated in accordance with the 2010 i2b2 annotation guidelines, for coreference identification purposes. After we removed those co-reference entities in the gold standard and re-evaluated the performance of both ChatGPT and GPT-3, we observed modest increases in performance, with ChatGPT achieving an F1 score of 0.628 using Prompt-2 and GPT-3 attaining an F1 score of 0.500 in the relaxed-match criteria. Moreover, we observed a significant degree of randomness in ChatGPT's output. Even when presented with the same prompt and the same input text, it sometimes generated responses with considerable differences in format and content. This phenomenon was particularly prevalent when the input note was lengthy, despite our efforts to minimize input sequence length by limiting it to the HPI section. We anticipate this issue will be addressed when GPT-4 allows much longer text. Although it is not clear whether clinical corpora (and what types of clinical corpora) are used in training ChatGPT, ChatGPT has demonstrated its understanding of the medical text to a certain degree. We believe fine-tuning ChatGPT with domain-specific corpora, assuming OpenAI will provide such an API, will further improve its performance on clinical NLP tasks such as NER in the zero-shot fashion.

large language model, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2303.16416

Country: North America > United States > Texas (0.04)

Genre: Research Report > New Finding (0.95)

Industry:

Health & Medicine > Health Care Technology (0.70)
Health & Medicine > Diagnostic Medicine (0.68)
Health & Medicine > Health Care Providers & Services (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Lightweight Transformers for Clinical Natural Language Processing

Rohanian, Omid, Nouriborji, Mohammadmahdi, Jauncey, Hannah, Kouchaki, Samaneh, Group, ISARIC Clinical Characterisation, Clifton, Lei, Merson, Laura, Clifton, David A.

arXiv.org Artificial IntelligenceFeb-9-2023

Specialised pre-trained language models are becoming more frequent in NLP since they can potentially outperform models trained on generic texts. BioBERT (Sanh et al., 2019) and BioClinicalBERT (Alsentzer et al., 2019) are two examples of such models that have shown promise in medical NLP tasks. Many of these models are overparametrised and resource-intensive, but thanks to techniques like Knowledge Distillation (KD), it is possible to create smaller versions that perform almost as well as their larger counterparts. In this work, we specifically focus on development of compact language models for processing clinical texts (i.e. We developed a number of efficient lightweight clinical transformers using knowledge distillation and continual learning, with the number of parameters ranging from 15 million to 65 million. These models performed comparably to larger models such as BioBERT and ClinicalBioBERT and significantly outperformed other compact models trained on general or biomedical data. Our extensive evaluation was done across several standard datasets and covered a wide range of clinical text-mining tasks, including Natural Language Inference, Relation Extraction, Named Entity Recognition, and Sequence Classification. To our knowledge, this is the first comprehensive study specifically focused on creating efficient and compact transformers for clinical NLP tasks. The models and code used in this study can be found on our Huggingface profile at https: //huggingface.co/nlpie and Github page at https://github.com/ Large language models pre-trained on generic texts serve as the foundation upon which most stateof-the-art NLP models are built. There is ample evidence that, for certain domains and downstream tasks, models that are pre-trained on specialised data outperform baselines that have only relied on generic texts (Sanh et al., 2019; Alsentzer et al., 2019; Beltagy et al., 2019; Nguyen et al., 2020; Chalkidis et al., 2020).

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2302.04725

Country:

Europe > United Kingdom > England > Oxfordshire > Oxford (0.28)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Europe > Norway (0.04)
(20 more...)

Genre: Research Report > New Finding (1.00)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Therapeutic Area > Immunology (1.00)
Health & Medicine > Therapeutic Area > Hematology (1.00)
(11 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.86)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.66)

Add feedback

Knowledge Base Completion for Constructing Problem-Oriented Medical Records

Mullenbach, James, Swartz, Jordan, McKelvey, T. Greg, Dai, Hui, Sontag, David

arXiv.org Machine LearningAug-7-2020

Both electronic health records and personal health records are typically organized by data type, with medical problems, medications, procedures, and laboratory results chronologically sorted in separate areas of the chart. As a result, it can be difficult to find all of the relevant information for answering a clinical question about a given medical problem. A promising alternative is to instead organize by problems, with related medications, procedures, and other pertinent information all grouped together. A recent effort by Buchanan (2017) manually defined, through expert consensus, 11 medical problems and the relevant labs and medications for each. We show how to use machine learning on electronic health records to instead automatically construct these problem-based groupings of relevant medications, procedures, and laboratory tests. We formulate the learning task as one of knowledge base completion, and annotate a dataset that expands the set of problems from 11 to 32. We develop a model architecture that exploits both pre-trained concept embeddings and usage data relating the concepts contained in a longitudinal dataset from a large health system. We evaluate our algorithms' ability to suggest relevant medications, procedures, and lab tests, and find that the approach provides feasible suggestions even for problems that are hidden during training. The dataset, along with code to reproduce our results, is available at https://github.com/asappresearch/kbc-pomr.

artificial intelligence, knowledge base completion, machine learning, (14 more...)

arXiv.org Machine Learning

2004.12905

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > United States > Massachusetts (0.04)
Europe > Middle East > Malta > Port Region > Southern Harbour District > Valletta (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report > New Finding (0.48)

Industry: Health & Medicine > Health Care Technology > Medical Record (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)

Add feedback

What if AI in health care is the next asbestos? - STAT

#artificialintelligenceJun-20-2019, 20:19:43 GMT

Artificial intelligence is often hailed as a great catalyst of medical innovation, a way to find cures to diseases that have confounded doctors and make health care more efficient, personalized, and accessible. But what if it turns out to be poison? Jonathan Zittrain, a Harvard Law School professor, posed that question during a conference in Boston Tuesday that examined the use of AI to accelerate the delivery of precision medicine to the masses. "I think of machine learning kind of as asbestos," he said. "It turns out that it's all over the place, even though at no point did you explicitly install it, and it has possibly some latent bad effects that you might regret later, after it's already too hard to get it all out."

artificial intelligence, health care, zittrain, (13 more...)

#artificialintelligence

Country:

North America > United States > North Carolina (0.05)
North America > United States > New York (0.05)
Asia > Afghanistan (0.05)

Industry:

Health & Medicine > Therapeutic Area (1.00)
Education > Educational Setting > Higher Education (0.55)
Education > Curriculum > Subject-Specific Education (0.55)

Technology: Information Technology > Artificial Intelligence > Applied AI (1.00)

Add feedback

Artificial Intelligence In Healthcare: Separating Reality From Hype - The Art of Transforming Network into Networking

#artificialintelligenceAug-22-2018, 11:18:07 GMT

It's impossible to read about the future of healthcare without encountering two pixelated vowels that, together, represent the hopes and fears of an industry seeking more intelligent solutions. Though the field of artificial intelligence (AI) has been around since 1956, it has made precious few contributions to medical practice. Only recently has the hype of machine-based learning begun to merge with reality. Confusion surrounding AI--its applications in healthcare and even its definition--remains widespread in popular media. Today, AI is shorthand for any task a computer can perform just as well as, if not better than, humans.

artificial intelligence, deep learning, machine learning, (15 more...)

#artificialintelligence

Country: North America > United States > California > San Francisco County > San Francisco (0.05)

Industry:

Health & Medicine > Therapeutic Area > Oncology (1.00)
Health & Medicine > Diagnostic Medicine (0.70)
Leisure & Entertainment > Games > Chess (0.70)

Technology:

Information Technology > Artificial Intelligence > Applied AI (0.83)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)

Add feedback

A hybrid deep learning approach for medical relation extraction

Chikka, Veera Raghavendra, Karlapalem, Kamalakar

arXiv.org Machine LearningJun-26-2018

Mining relationships between treatment(s) and medical problem(s) is vital in the biomedical domain. This helps in various applications, such as decision support system, safety surveillance, and new treatment discovery. We propose a deep learning approach that utilizes both word level and sentence-level representations to extract the relationships between treatment and problem. While deep learning techniques demand a large amount of data for training, we make use of a rule-based system particularly for relationship classes with fewer samples. Our final relations are derived by jointly combining the results from deep learning and rule-based models. Our system achieved a promising performance on the relationship classes of I2b2 2010 relation extraction task.

artificial intelligence, machine learning, relation, (17 more...)

arXiv.org Machine Learning

1806.11189

Country:

Europe > United Kingdom > England > Greater London > London (0.05)
North America > United States > Massachusetts > Suffolk County > Boston (0.05)
Asia > India > Telangana > Hyderabad (0.05)
North America > United States > New York > New York County > New York City (0.04)

Genre: Research Report (0.50)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (0.68)
Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Future Vision Society Subject areas Publishing and editorial

AITopics Original LinksJan-18-2017, 10:21:38 GMT

This report summarises views expressed in the second of three BCS Thought Leadership Debates run in association with the UK Office of Science and Technology as part of the Cognitive Systems Programme of the UK Foresight Programme, which explores areas for advanced research. The event was on 25 November 2004 at the Institute of Directors in London. Two expert speakers stimulated debate with short talks and then the 30 participants discussed the topic in small groups over dinner. The participants were mainly senior computer scientists and neuroscientists from UK universities, with others from specialist companies and research organisations. After dinner each table reported back to the entire gathering.

artificial intelligence, computer science, image analysis, (10 more...)

AITopics Original Links

Country:

Europe > United Kingdom (0.25)
North America > United States (0.05)

Industry:

Health & Medicine > Therapeutic Area > Neurology (0.55)
Health & Medicine > Diagnostic Medicine > Imaging (0.49)

Technology:

Information Technology > Artificial Intelligence > Vision (0.55)
Information Technology > Artificial Intelligence > Cognitive Science (0.38)

Add feedback